Data modeling as a main source of discrepancies in single and multiple marker association methods

نویسندگان

  • Mônica Corrêa Ledur
  • Nicolas Navarro
  • Miguel Pérez-Enciso
چکیده

BACKGROUND Genome-wide association studies have successfully identified several loci underlying complex diseases in humans. The development of high density SNP maps in domestic animal species should allow the detection of QTLs for economically important traits through association studies with much higher accuracy than traditional linkage analysis. Here we report the association analysis of the dataset simulated for the XII QTL-MAS meeting (Uppsala). We used two strategies, single marker association and haplotype-based association (Blossoc) that were applied to i) the raw data, and ii) the data corrected for infinitesimal, sex and generation effects. RESULTS Both methods performed similarly in detecting the most strongly associated SNPs, about ten loci in total. The most significant ones were located in chromosomes 1, 4 and 5. Overall, the largest differences were found between corrected and raw data, rather than between single and multiple marker analysis. The use of raw data increased greatly the number of significant loci, but possibly also the rate of false positives. Bootstrap model aggregation removed most of discrepancies between adjusted and raw data when SMA was employed. CONCLUSION Model choice should be carefully considered in genome-wide association studies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Single Nucleotide Polymorphisms and Association Studies: A Few Critical Points

Uncovering DNA sequence variations that correlate with phenotypic changes, e.g., diseases, is the aim of sequence variation studies. Common types sequence variations are Single nucleotide polymorphism (SNP, pronounced snip).SNPs are the third-generation molecular marker. SNP represents a DNA sequence variant of a single base pair with the minor allele occurring in more than 1% of a given popula...

متن کامل

بررسی ارتباط پلی‌مورفیسم rs1800624در ژن RAGE با بیماری مولتیپل اسکلروز در اصفهان

Introduction: Multiple sclerosis (MS) is an acute disease of the central nervous system (CNS) associated with the degradation of myelin sheet around the nerve cells. It is assumed to be a multifactorial disorder that is to say numerous environmental and genetic factors are involved in the disease. Therefore, this study aimed to investigate the association between rs1800624 single nucleotide pol...

متن کامل

Selection of Variables that Influence Drug Injection in Prison: Comparison of Methods with Multiple Imputed Data Sets

Background: Prisoners, compared to the general population, are at greater risk of infection. Drug injection is the main route of HIV transmission, in particular in Iran. What would be of interest is to determine variables that govern drug injection among prisoners. However, one of the issues that challenge model building is incomplete national data sets. In this paper, we addressed the process ...

متن کامل

Bayesian Sample Size Determination for Joint Modeling of Longitudinal Measurements and Survival Data

A longitudinal study refers to collection of a response variable and possibly some explanatory variables at multiple follow-up times. In many clinical studies with longitudinal measurements, the response variable, for each patient is collected as long as an event of interest, which considered as clinical end point, occurs. Joint modeling of continuous longitudinal measurements and survival time...

متن کامل

Flood Hydrograph Analysis Through Employing Physical Attributes Using Two and Multiple Variables Regression Factor and Cluster Analysis

Since direct experimental evidence is not available, this must be verified through a modeling approach, providedadequate data be available. Many statistical methods are used to study the relation between independent anddependent variables.This research was carried out at the western part of Jazmurian basin tlocated in the southeast ofIran. In this paperused ten physical characteristics such as ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • BMC Proceedings

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2009